Processor register recovery after flush operation

ABSTRACT

An information handling system includes a processor that may perform general purpose register recovery operations after an instruction flush operation that an exception, such as a branch misprediction causes. The processor receives an instruction stream that may include multiple instructions that operate on a particular target register that stores instruction result information. The general purpose register may temporarily store instruction opcode and register bits information for use during dispatch, execution and other operations. The processor includes a recovery buffer unit for use during flush recovery operations. The processor may use recovery valid and recovery pending bits that correspond with each instruction during the register recovery from flush operation.

BACKGROUND

The disclosures herein relate generally to processors, and morespecifically, to processors that employ register recovery mechanismsafter flush operations.

Modern information handling systems (IHSs) may track dependenciesbetween instructions by using reservation station (RS) and recoverybuffer units (RBU)s. Because out-of-sequence instruction handling iscommon in modern IHSs, processors typically track the dependenciesbetween younger instructions and older instructions. If youngerinstructions depend on the results of older instructions, those youngerinstructions cannot complete until the results for the olderinstructions are available. During instruction dispatch, a generalpurpose register (GPR) may provide register A/register B (RA/RB) operandinformation that the reservation station (RS) receives. The recoverybuffer unit (RBU) receives target register (RT) information from theGPR. The GPR may include all information that younger instructions needto track in case a younger instruction is dependent upon an olderinstructions execution results.

In the case of a flush operation, such as instruction branch flush, theIHS typically will recover to a state prior to the flush operation. Theprocessor reads the RBU and moves data to the GPR to restore the GPR toa state or point prior to the flush. The instruction dispatch processmay not resume until the GPR completely recovers from the flush event.In the case where multiple instructions write to the same targetregister (RT) location in the GPR, the recovery buffer unit (RBU) willcontain multiple entries that reference the same target register (RT)location. During a flush there will be multiple RBU entries to read outof the recovery buffer unit (RBU) to restore a particular targetregister (RT) location with the proper data prior to the flush.

BRIEF SUMMARY

Accordingly, in one embodiment, a method of processing instructions isdisclosed. The method includes fetching, by a fetch unit, a plurality ofinstructions from an instruction source. The method also includesdecoding the plurality of instructions, by a decoder/dispatcher, theplurality of instructions including instructions that reference a targetregister of a general purpose register for storage of results thereof,the instructions that reference the target register including a currentinstruction and previous instructions. The method further includesdispatching, by a dispatcher, the plurality of instructions to an issuequeue. The method still further includes issuing, by the issue queue,the plurality of instructions that reference the particular targetregister out-of-order to execution units.

The method also includes storing a respective entry in a recovery bufferunit for each of the previous instructions and the current instruction,the previous instructions and the current instruction referencing thesame target register. The method further includes executing, by theexecution units, the current instruction and the previous instructions.The method also includes determining, by a branch execution unit, if anexception occurred for one of the previous instructions that require anissue queue flush. The method further includes performing an issue queueflush if an exception occurred in the determining step. The method alsoincludes performing, in response to the exception, a recovery bufferunit recovery to determine an oldest valid entry of the recovery bufferunit. The method further includes restoring the general purpose registerto a known good state using a result of a target register correspondingto the oldest valid entry of the recovery buffer unit determined in therecovery buffer unit recovery.

In another embodiment, a processor is disclosed that includes a fetchunit that fetches a plurality of instructions from an instructionsource. The processor also includes a decoder/dispatcher that decodesthe plurality of instructions, the plurality of instructions includinginstructions that reference a target register of a general purposeregister for storage of results thereof, the instructions that referencethe target register including a current instruction and previousinstructions. The decoder/dispatcher dispatches the plurality ofinstructions to an issue queue that is coupled to thedecoder/dispatcher. The processor still further includes a generalpurpose register, coupled to the decoder/dispatcher that stores a stateof the processor. The processor also includes a plurality of executionunits, coupled to the issue queue, wherein the issue queue issues theplurality of instructions that reference the particular target registerout-of-order to the execution units.

The processor further includes a recovery buffer unit, coupled to thegeneral purpose register, that stores a respective entry for each of theprevious instructions and the current instruction, the previousinstructions and the current instruction referencing the same targetregister, wherein the execution units speculatively execute the currentinstruction and the previous instructions. The processor also includes abranch execution unit, coupled to the issue queue, that determines if anexception occurred for one of the previous instructions that requires anissue queue flush, wherein the issue queue performs an issue queue flushif the exception occurred. The recovery buffer unit, in response to theissue queue flush, performs a recovery buffer unit recovery to determinean oldest valid entry of the recovery buffer unit, the recovery bufferunit restoring the general purpose register to a known good state usinga result of a target register corresponding to the oldest valid entry ofthe recovery buffer unit.

BRIEF DESCRIPTION OF THE DRAWINGS

The appended drawings illustrate only exemplary embodiments of theinvention and therefore do not limit its scope because the inventiveconcepts lend themselves to other equally effective embodiments.

FIG. 1 is block diagram of an information handling system (IHS) thatincludes the disclosed processor register recovery after flushmethodology.

FIG. 2 is a block diagram showing more detail of the processor thatemploys the register recovery after flush methodology.

FIGS. 3A-3B show a flowchart that shows process flow in the processor ofFIG. 2 as it employs the disclosed register recovery after flush method.

FIGS. 4A-4B is a flowchart that shows process flow in the processor ofFIG. 2 as it employs the disclosed reservation station instruction issuemethod.

FIGS. 5A-5B show a flowchart that shows process flow in the processor ofFIG. 2 as it employs the disclosed flush operation method.

FIGS. 6A-6B is a flowchart that shows process flow in the processor ofFIG. 2 as it employs the disclosed stall penalty reduction method.

DETAILED DESCRIPTION

Modern processors often use operand reservation stations and a recoverybuffer unit to manage general purpose register (GPR) data recovery aftera flush. An exception, such as a branch misprediction, may necessitate aflush operation. As part of the flush operation, a processor may analyzeall of the instructions of a recovery buffer unit. More particularly, inthis flush operation, the processor may analyze all of the instructionof the recovery buffer unit, from the youngest instruction entry to theoldest instruction entry, to restore the general purpose register (GPR)to a known good data state just prior to the flush. The processor mayconsume a substantial amount of time to perform this analysis ofinstructions. GPR recovery time affects processor latency.

In one embodiment, the disclosed processor may employ a recovery bufferunit that stores pointers to instructions that correspond to entries ofthe recovery buffer unit. The recovery buffer unit may store instructioninformation, such as the target register (RT) data into which aparticular instruction writes results. The processor may quickly analyzethe pointers and target register data to determine an oldest targetregister (RT) that corresponds to the last known good state of the GPRprior to the exception and flush.

FIG. 1 shows an information handling system (IHS) 100 that may employthe disclosed register recovery after flush method. IHS 100 includes aprocessor 200 that couples to a bus 110. A memory controller 115 couplesto bus 110. A memory bus 120 couples system memory 125 to memorycontroller 115. A video graphics controller 130 couples display 135 tobus 110. IHS 100 includes nonvolatile storage 140, such as a hard diskdrive, CD drive, DVD drive, or other nonvolatile storage that couples tobus 110 to provide IHS 100 with permanent storage of information.Nonvolatile storage 140 is a form of data store. I/O devices 150, suchas a keyboard and a mouse pointing device, couple via an I/O bus 155 andan I/O controller 160 to bus 110.

One or more expansion busses 165, such as USB, IEEE 1394 bus, ATA, SATA,eSATA, PCI, PCIE and other busses, couple to bus 110 to facilitate theconnection of peripherals and devices to IHS 100. A network interface170 couples to bus 110 to enable IHS 100 to connect by wire orwirelessly to other network devices. IHS 100 may take many forms. Forexample, IHS 100 may take the form of a desktop, server, portable,laptop, notebook, or other form factor computer or data processingsystem. IHS 100 may also take other form factors such as a personaldigital assistant (PDA), a gaming device, a portable telephone device, acommunication device or other devices that include a processor andmemory.

FIG. 2 shows a processor 200 that may employ the disclosed registerrecovery after flush method. In that case, processor 200 performs thefunctional blocks of the flowchart of FIGS. 3A-3B described below thatapply to the register recovery process. Processor 200 includes a cachememory 205 that may receive processor instructions from such as systemmemory 125, non-volatile storage 140, expansion bus 165, networkinterface 170, or other sources not shown in FIG. 2. Cache memory 205couples to a fetch unit 207 that processor 200 employs to fetch multipleinstructions from cache memory 205. Instructions may be in the form ofan instruction stream that includes a series or sequence of processorprogram instructions. Fetch unit 207 couples to an instruction cache 210to temporarily store the instruction stream for further processing byprocessor 200. Instruction cache 210 couples to a decoder/dispatcher 215that provides decoding and dispatching of instructions as the resourcesof processor 200 become available.

Decoder/dispatcher 215 couples to an issue unit/issue queue 220 thatissues instructions for execution to execution units. Issue unit/issuequeue 220 employs multiple execution units (EU)s for execution ofinstructions in processor 200. Issue unit/issue queue 220 couples to EU221 that includes multiple execution units. EU 221 includes branchexecution unit (BEU) 222, integer execution unit IEU 290, floating pointexecution unit (FPEU) 295, and other EUs not shown. Multiple EUs mayprovide processor 200 with multi-processing or parallel instructionprocessing capability.

EU 221 may provide instruction tag (ITAG) information in the form of abroadcast or communication to other units of processor 200. As shown bythe “ITAG DATA” arrow from EU 221 to units within processor 200, EU 221provides instruction tag or ITAG data in the form of broadcastcommunications. For example, execution units of EU 221, namely IEU 290,and FPEU 295 broadcast an execution ITAG or EXEC ITAG 223 to GPR 230, RS270, RBU 240, and BEU 222. In other words, IEU 290, and FPEU 295 ofprocessor 200 broadcast execution ITAG and other results to GPR 230,reservation station RS 270, recovery buffer unit RBU 240, and BEU 222.The BEU 222 is responsible for instruction completion and providingcompletion instruction tag COMP ITAG 224 and flush instruction tag orFLUSH ITAG 225 data to GPR 230. In one embodiment, BEU 222 broadcastsCOMP ITAG 224 and FLUSH ITAG 225 data to GPR 230, RS 270, RBU 240.

An instruction tag register or ITAG stores pointer address data for aparticular instruction. For example, decoder/dispatcher 215 may dispatchthe particular instruction that fetch unit 207 reads from IHS 100memory. The particular instruction may reside in system memory 125 orother memory of IHS 100 during processor 200 fetch operations. The ITAGdata, such as COMP ITAG 224 is a register that stores pointerinformation that may provide processor 200 with the particularinstruction information store. The EXEC ITAG 223 is a register that maystore the currently executing ITAG information for processor 200,whereas COMP ITAG 224 is a register that may store the last completeinstruction ITAG data.

FLUSH ITAG 225 may store the last instruction ITAG prior to a flushoperation in processor 200. Instructions may not always execute in thesame order that they dispatch from decoder/dispatcher 215 or issue fromissue unit/issue queue 220. It is common for instructions to execute“out of order”. This out of order execution provides challenges,particularly when processor 200 flushes instruction cache 210 for anyreason. One such reason for a flush may be a look-a-head branchmisprediction fault and subsequent instruction cache 210 flush. Arecovery pending unit (RPU) 226 couples to decoder/dispatcher 215. Theoutput of decoder/dispatcher 215 couples to RPU 226 to provide registerinformation thereto. RPU 226 may include RA and RB registers (not shown)for storage of RA/RB operand information and other informationpertaining to processor 200 instructions. The output ofdecoder/dispatcher 215 also couples to a general purpose register (GPR)230.

During instruction dispatch, namely operation of decoder/dispatcher 215,instruction register information stores in RPU 226 and GPR 230. Forexample, an executing instruction may write data into a particularregister of GPR 230. GPR 230 may store multiple registers such as aregister A (RA) 231, a register B (RB) 232, or a target register (RT)233 that the output of a particular instruction writes into. An examplemay be an instruction that requires the addition of the contents ofregisters RA 231 and RB 232 (i.e. RA 231+RB 232) and places the resultin target register RT 233. RT 233 may store an entry for eachinstruction that writes data thereto. GPR 230 also includes registers W234, RT address field RTAF 237, ITAG 235 and V 236 to store instructioninformation specific to a particular instruction during dispatch.

During an instruction, dispatch processor 200 stores an entry write bit(W) 234, RTAF 237 data, an instruction tag (ITAG) 235, and a valid bit(V) 236 in general purpose register (GPR) 230. In this document, the bitthat a particular register stores may also identify the register thatstores that bit. For example, W register 234 stores an entry write bitW. The RTAF 237 data identifies the particular register in GPR 230 thatreceives current instruction results. ITAG register 235 stores aninstruction tag (or ITAG) and V register 236 stores a valid bit (V). Theentry write bit W and valid bit V information are specific to theparticular instruction currently dispatching. In other words, W 234 andV 236 represent entry write bit data and valid bit data, respectively,for the particular ITAG 235 instruction pointer. A pointer is an addressof an instruction as opposed to the instruction itself. In one example,a W 234 bit=1 indicates that an instruction data result resides in GPR230. In contrast, a W 234 bit=0 indicates that instruction data does notreside in GPR 230, or that the instruction is not yet complete.

To support a register recovery from a flush, processor 200 stores RA andRB register information in RPU 226 during the same instruction dispatchoperation described above. RPU 226 may store multiple instruction RA/RBdata (not shown) to support recovery of GPR 230 to a particular state ofprocessor 200 following a flush operation. ITAG 235 stores aninstruction pointer to the instruction or opcode that writes to RT 233.For purposes of this example, GPR 230 may represent one particular stateof processor 200, and more particularly, information for the currentinstruction.

Processor 200 employs a recovery buffer unit (RBU) 240 that providesdata for use after a flush. GPR 230 couples to RBU 240 to provideregister bit information on a per instruction basis. For example, GPR230 may write register bit information to RBU 240 during instructiondispatch and RBU 240 may write register bit information to GPR 230during flush recovery. Recovery buffer unit (RBU) 240 includes a groupof registers, namely RT 241, W 242, ITAG 243 and V 244. RBU 240 alsoincludes a previous instruction tag (PITAG) register 245, a GPR DATA248, RT address field RTAF 249, and a valid bit (V) register 246.Registers W 242, ITAG 243 and V 244 store instruction information aboutthe newest or younger instruction of instructions dispatching inprocessor 200. Registers PITAG 245, RT 241, and V 246 store instructioninformation for the previous or older instruction of instructionsdispatching in processor 200. PITAG 245 includes instruction pointerinformation about an instruction previous to the ITAG 243 instructionpointer information. RT 241 points to GPR 230 entry data that RBU 240restores during a flush operation.

Multiple instructions that decoder/dispatcher 215 dispatches may eachseek to update a particular register location in GPR 230, such as RT233. During instruction dispatch and execution, multiple temporary RTdata, namely RT 241 entries may reside in RBU 240 or other memory of IHS100 and upon completion of multiple instructions resolve to a single RT233 entry. In one embodiment, during a flush, only one of thosetemporary RT 241 entries is valid. One embodiment of the disclosedmethodology describes a process to identify one entry of RT 241 frommultiple RT entries in recovery buffer unit 240 to recover after aflush. During dispatch, a particular instruction may write informationinto RT 233 of GPR 230. During that same process, ITAG 235 informationwill also be written to provide a pointer to that particularinstruction. If a younger instruction is reading the RT 233 data, theyounger instruction will also read the ITAG 235 data and wait for thatITAG 235 instruction to complete and produce results before the youngerinstruction completes.

Recovery buffer unit (RBU) 240 includes a recovery valid (RV) registerfor RV bit 247. The RV bit of RV 247 provides information that processor200 uses to determine which RBU entries support the recovery of GPR 230after a flush. GPR 230 includes a recovery pending bit RP 238. RP 238 isa bit or flag that corresponds to a particular entry in GPR 230. RP 238may indicate that the particular entry in GPR 230 requires an RBUrecovery operation in response to the instruction queue flush operation.Processor 200 interprets the data of RBU 240 and using RV 247, and otherbit data, determines that RBU entry to supply data for GPR 230 recovery.

During dispatch, when a particular current instruction accesses RT 233,processor 200 reads RT 233, GPR 230 data at that RT 233 location, W 234,ITAG 235, and the V 236 register and stores that data into RBU 240.Processor 200 also stores the previous instruction's ITAG information inthe PITAG 245 register of recovery buffer unit RBU 240, along with V 246register data. In this manner, RBU 240 stores the original or previousresult data at location RT 241 from the previous instruction PITAG 245and the previous ITAG information PITAG 245. RBU 240 also stores theparticular current or younger instruction data at location RT 241 fromthe current instruction pointer ITAG 243.

RBU 240 includes multiple registers that each store multiple dataentries for instructions that write to target register RT 233. RegistersITAG 243 and PITAG 245 of RBU 240 each couple to a flush compare unit250 to provide current and previous instruction pointer informationthereto. Flush compare unit 250 couples to recovery valid (RV) register247 to provide recovery validity register data from comparison of ITAG243 and PITAG 245 information. When a flush occurs, flush compare unit250 compares ITAG 243 and PITAG 245 data.

If ITAG 243 is the same or younger than the flush ITAG 225, and PITAG245 is also the same or younger than flush ITAG 243, processor 200determines that the RBU 240 data will not require recovery after theflush. If ITAG 243 is the same or younger than flush ITAG 243, and PITAG245 is older than flush ITAG 225, the processor 200 determines that theRBU 240 will require recovery after the flush. However, if both ITAG 243and PITAG 245 are older than flush ITAG 225, the RBU 240 data will notrequire recovery after the flush operation. RBU 240 stores one entry perinstruction, thus RV 247 stores one recovery valid bit per instruction.

The RV 247 bit is set =0 at flush time for any entries that processor200 does not require for recovery. RV 247 is set =1 at flush time forany entries that processor 200 requires for recovery. If RV 247 bit=1,processor 200 reads instruction register data from RBU 240 and writesthat data into GPR 230 registers. Processor 200 may perform a compare ofall RT and ITAG entry data in RBU 240. In other words, in oneembodiment, flush compare unit 250 may compare all instructions thatperform a write to RT 241 and flush only those instructions thatspecifically update RT 241. In this manner, only the oldest RT entrythat processor 200 does not flush will need recovery in RBU 240.Recovery buffer unit RBU 240 may be read from youngest instruction tooldest, or from oldest instruction to youngest order during recovery.

ITAG 235 of GPR 230 couples to a flush compare unit 260. Flush compareunit 260 couples to RPU 226 to provide flush status for GPR 230. At thestart of a flush, flush compare unit 260 sends a flush signal to RPU 226to instruct RPU 226 that GPR 230 is now recovering from a flush. In oneembodiment, any instructions of processor 200 that exhibit dependenciesto GPR 230 wait until the recovery process completes before completingexecution. After the GPR 230 register recovery process completes,instructions with dependencies to GPR 230 may complete execution aswell.

In one embodiment, GPR 230 provides register RA 231 and RB 232information to operand reservation station (RS) 270 to provide registerRA 272 and register RB 274 data. Registers W 234 and ITAG 235 couplerespectively to a W register 275 and an ITAG register 277 to provideinstruction register information for temporary storage and analysisduring flush recovery. Reservation station (RS) 270 couples to anoperands 280 register to provide RA and RB register data that eachoriginate from registers RA 231 and RB 232 of GPR 230. Operands register280 couples to EU 221 and provides RA 231 and RB 232 data to BEU 222, aninteger execution unit (IEU) 290, and a floating point execution unit(FPEU) 295. W register 275 and ITAG register 277 each couple to acompare unit 285. Compare unit 285 couples to RS 270 to provide compareresults of instruction ITAG information during multiple instructiondispatch, decode, issue and execution operations of processor 200. EU221 couples to compare unit 285, RS 270, GPR 230, and RBU 240. Issueunit 220 couples to EU 221, including BEU 222, IEU 290, and FPEU 295.BEU 222, IEU 290 and FPEU 295 include the group of execution units forprocessor 200.

FIGS. 3A-3B show a flowchart that describes one example of the disclosedregister recovery after flush method. The register recovery after flushmethod starts, as per block 305. Processor 200 of IHS 100 allocatesrecovery buffer unit (RBU) 240 entry memory, as per block 310. Thatallocation provides register memory space for a dispatching instructionfor each register entry for RBU 240, such as those of registers RT 241,W 242, ITAG 243, V 244, PITAG 245, V 246 and RV 247. Processor 200allocates reservation station (RS) 270 entry memory, as per block 315.That allocation provides memory space for the dispatching instruction inreservation station RS 270. Decoder/dispatcher 215 dispatches aparticular instruction in a sequence of instructions from instructioncache 210, as per block 318. Processor 200 reads RA 231, RB 232, W 234,ITAG 235, and V 236 data from GPR 230 for the particular dispatchinginstruction, as per block 320.

Processor 200 writes RA 272, RB 274, W 275, and ITAG 277 data intoreservation station (RS) 270, as per block 325. Processor 200 reads theV 236 valid bit data from GPR 230 to determine if that bit is set to 1,as per block 370. If V 236 is set to 1, processor 200 reads the W 234bit from GPR 230 to determine if that bit is set to 1, as per block 330.If the GPR 230 V 236 bit is not equal to one, then processor 200 sets Wbit=1 in issue unit/issue queue 220 for the currently issuinginstruction, as per block 335. If the GPR 230 W 234 is not =1, processor200 sets the W bit=0 in issue unit/issue queue 220 for the currentlyissuing instruction, as per block 338. If GPR 230 V 236 bit is =1 andGPR 230 W 234 bit=1, then processor 200 sets the W bit=1 in issueunit/issue queue 220, as per block 335. After dispatching theinstruction at block 318, processor 200 reads RT 231, RB 232, W 234,ITAG 235, and V 236 data from GPR 230 for the previous instruction, asper block 340.

Processor 200 writes RBU 240 data, namely RT 241, W 242, ITAG 243, V244, RTAF 249, and GPR DATA 248 from previous instruction data, as perblock 345. Processor 200 writes GPR 230 ITAG 235 data into RBU 240 PITAG245 and V 246 registers as previous instruction information, as perblock 350. Processor 200 performs a test to see if GPR 230 V 236=1, asper block 355. If the V 236 bit of GPR 230 for the dispatchinginstruction is equal to 1, then V 246 of RBU 240 is set to 1, as perblock 360. If the V 236 bit of GPR 230 for the dispatching instructionis not equal to 1, then V 246 of RBU 240 is set to 0, as per block 365.Once the V 246 bit is set to either 1 or 0, processor 200 writes thedispatching instruction information into RBU 240, (RT 241, W 242, ITAG243, V 244, and GPR DATA 248), as per block 370. Processor 200 alsowrites the V 244 bit=1, as per block 370.

Processor 200 sets V 244=1 in RBU 240, as per block 375. After the GPR230 read per block 340 above, processor 200 writes dispatchinginstruction ITAG information, into GPR 230 ITAG 235 register and sets W234=0, and V 236=1, as per block 380. Processor 200 performs a test todetermine if dispatch is complete, as per block 380. If dispatch is notcomplete, processor 200 performs an instruction dispatch, as per block318. However, is dispatch is complete, the disclosed register recoveryafter flush method ends, as per block 390.

FIGS. 4A-4B show a flowchart that describes one example of the disclosedreservation station instruction issue method. The reservation stationinstruction issue method starts, as per block 405. Processor 200 setsissue unit/issue queue 220 W bit, RBU 240 V 244 bit, and GPR 230 W 234and V 236 bits, as per block 410. Processor 200 sets the register bitsof issue unit/issue queue 220, RBU 240, and GPR 230 following a registerrecovery operation, such as described in FIGS. 3A-3B above. Processor200 examines each GPR 230 W 234 bit simultaneously, as per block 415.For example, each instruction operand includes a W 234 bit for RA 231and a W 234 bit for RB 232.

Processor 200 performs a test to determine if W 234=1 for allinstruction operands of GPR 230, as per block 420. When both of these Wbits, namely RA 231 and RB 232 are equal to 1 then the instruction isready to issue for execution. If all issue unit/issue queue 220instructions operands do not equal 1, processor 200 examines the nextinspection, as per block 425. If all W 234 bits of any particularinstruction are =1, then that particular instruction is ready for issue.Processor 200 selects the next oldest ready instruction for issue toEUs, such as BEU 222, IEU 290, and FPEU 295. Processor 200 executes thenext oldest instruction to EU 221, as per block 430. EU 221 broadcastsfinish execution ITAG to GPR 230, RBU 240, RS 270, and BEU 222 as perblock 435.

Following instruction execution, compare unit 285 compares executionITAG information for executing instruction EXEC ITAG 223 with ITAG 277of RS 270, as per block 440. Compare unit 285 compares EXEC ITAG 223 ofexecuting instruction with ITAG 243 and PITAG 245 of RBU 240, as perblock 445. In one embodiment, BEU 222 examines the finishing instructionto determine if that finishing instruction is the oldest instruction forcompletion. If the finishing instruction is eligible for completion, BEU222 completes the instruction and sends the completion ITAG to GRP 230and RBU 240.

After the next instruction examination, as shown in block 425, processor200 performs a test to determine if the EXEC ITAG 223 and RS 270 ITAGSmatch, as per block 448. If the ITAG match is true, processor 200 writesthe results of EU 221 into RS 270, as per block 450. Processor 200 setsW 275=1 in RS 270, as per block 452. However, if the ITAG match test isnot true, processor 200 performs a test to determine if the currentinstruction is ready to complete, as per block 455. Following the ITAGcompare per block 440, processor 200 writes results instructionexecution results into GPR 230 using RTAF 237 as the index address intoGPR 230, as per block 458. The W 234 bit in the corresponding RT 233location is set =1, as per block 460. In that manner, the GPR 230instruction execution results location is available for youngerdispatching instruction use.

Processor 200 performs a test to determine if EXEC ITAG 223 and RBU 240ITAGS match, as per block 462. If the ITAG match is true, processor 200writes the results of EU 221 to RBU 240, as per block 464. Processor 200sets W 242=1 in RBU 240, as per block 465. If the instruction is readyto complete, BEU 222 broadcasts completion ITAG data to GPR 230 and RBU240, as per block 460. However, if the instruction is not ready tocomplete, processor 200 continues testing for instruction completion, asper block 455. EU 221 broadcasts completion COMP ITAG 224 data to GRP230 and RBU 240, as per block 460. Compare unit 285 performs a test todetermine if COMP ITAG 224 from BEU 222 compares with GPR 230 ITAG 235,as per block 462. If COMP ITAG 224 compares with ITAG 235, processor 200resets both W 234=0, and V 236=0 of GPR 230, as per block 465.

Compare unit 285 performs a test to determine if COMP ITAG 224 from BEU222 compares with RBU 240 PITAG 245, as per block 470. If COMP ITAG 224compares with PITAG 245, processor 200 resets both W 242=0, and V 244=0of RBU 240, as per block 475. However if the COMP ITAG 224 from BEU 222does not compare with RBU 240 PITAG 245, or following the reset of W 242and V 244 bits, the disclosed reservation station instruction issuemethod ends, as per block 480.

FIGS. 5A-5B show a flowchart that describes one example of the disclosedflush operation method. The flush operation method starts, as per block505. Processor 200 performs a test to determine if a execute branchflush or exception fault operation completed, as per block 510. Ifneither operation completed, processor 200 sends FLUSH ITAG to RBU 240,as per block 512. Processor 200 compares FLUSH ITAG 225 to RBU 240 PITAG245 and RBU 240 ITAG 243, as per block 515. Processor 200 performs atest to determine if ITAG 243 is the same as or younger than FLUSH ITAG225, as per block 520. In other words, processor 200 determines which ofeach instruction of ITAG 243 and FLUSH ITAG 225 is older, younger or thesame.

If processor 200 determines that ITAG 243 is the same as or younger thanFLUSH ITAG 225, then processor 200 resets V 244 of RBU 240=0 and flowcontinues, as per block 522. Processor 200 may initiate a branch flushoperation and exception fault operation. BEU 222 will monitor suchoperations and may initiate a flush request. During a flush operation,BEU 222 sends a FLUSH ITAG 225 to RPU 226, GPR 230, RBU 240, and RS 270.FLUSH ITAG 225 initiates or communicates a flush operation to RBU 240and other units of processor 200, such as flush compare unit 260. RS 270compares FLUSH ITAG 225 to all RS 270 ITAGs. If any ITAGs match, asdescribed below, the valid V 246 bit for that entry is reset =0 toindicate that the flush operation for RS 270 is complete. In thatmanner, RS 270 is available for younger dispatching instructions.

Processor 200 performs a test to determine if PITAG 245 is the same asor younger than FLUSH ITAG 225, as per block 525. In other words,processor 200 determines which of each instruction of ITAG 243 and FLUSHITAG 225 is older, younger or the same. If processor 200 determines thatPITAG 245 is the same as or younger than FLUSH ITAG 225, then processor200 resets V 246 of RBU 240=0, and flow continues, as per block 525.However, if the ITAG 243 or PITAG 245 tests per blocks 350 and 525indicate that the ITAG is not the same or younger than FLUSH ITAG 225,processor 200 examines PITAG V 246 and ITAG V 244 data of RBU 240, asper block 530.

Processor 200 performs a test to determine the values of PITAG 245 V 246and ITAG 243 V 244. If PITAG 245 V 246=1 and ITAG 243 V 244=1 is true,as per block 535, then the executing instruction is not a flushinstruction. In that case, the executing instruction will not require arecovery operation. If PITAG 245 V 244=0 and ITAG 243 V 244=0 is true,as per block 540, then the executing instruction is a flush instructionand the previous instruction is not a flush instruction. In other words,for this case the PITAG 245 is the oldest surviving ITAG after the flushand thus requires a recovery operation.

When PITAG 245 V 246=1 and ITAG 243 V 244=0 is true, as per block 545,the RV bit is set to 1 at this location to indicate that a recoveryoperation will restore GPR 230 from this location, as per block 548.Processor 200 performs a test to determine if all PITAG 245 and ITAG 243entries are examined, as per block 550. If all entries are not examined,or if the PITAG 245 and ITAG 243 tests per block 535, and 540 are true,then processor 200 examines the entry data, as per block 555. Processor200 examines PITAG 245 V 246 and ITAG 243 V 244 data of RBU 240, as perblock 530. If processor 200 examines all entries, or if the test perblock 545 is not true, processor 200 reads the next RV 247 bit of RBU240, as per block 560.

Processor 200 examines all entries in RBU 240 at flush time. Afterprocessor 200 examines all RBU 240 entries, each RBU 240 RV 247 bits areset to represent a flush event, as described above. Processor 200initiates the recovery operation to restore GPR 230 entries to the pointprior to the flush operation. An RV 247 value of 1 indicates that theRBU 240 instruction information or entry requires recovery from theflush operation. Processor 200 performs a test to determine if RBU 240RV 247 bit=1, as per block 565. If RV 247 is not equal to 1, processor200 reads the next RV 247 bit of RBU 240 and flow continues, as perblock 560. However, if RV 247=1 is true, processor 200 reads PITAG 245,W 242, RT 214, and GPR DATA 248 from RBU 240, as per block 570.

Processor 200 writes restored GPR DATA 248, W 234, ITAG 235, and V 236data into GPR 230, using RT 214 of RBU 240 as write address, as perblock 572. Processor 200 uses the data of PITAG 245 and V 246 of RBU 240as input into the write of ITAG 235 and V 236 of GPR 230. Processor 230resets RV 247=0 at recovery location of current instruction of RBU 240,as per block 575. Processor 200 moves to the next instruction of RBU 240by reading the next RBU 240 RV 247 bit again, as per block 560.Processor 200 performs a test to determine if all RV 247 bits of RBU 240for all ITAG instruction entries =0, as per block 580. If all RV 247bits=0 is true, processor 200 resumes dispatching instructions, as perblock 582. In other words, if all instruction ITAG entries of RBU 240demonstrate a value of RV 247=0, processor 200 resumesdecoder/dispatcher 215 operation. However, if all RV 247 bits=0 is nottrue, processor 200 stops decoder/dispatcher 215, as per block 585. Theflush operation method ends, as per block 590.

In another embodiment, a method is described below to reduce thedispatch stall penalty during the flush recovery process. While RBU 240is recovering from a flush, such as a branch flush, decoder/dispatcher215 may incur stall penalties or delays that may degrade the overallperformance of processor 200. Each ITAG 235 entry in GPR 230 maintains acorresponding recovery pending bit RP 238 in recovery buffer unit RBU240. The RP 238 bit may be set at flush time to indicate the particularGPR 230 entries that will need recovery after the flush. After aparticular GPR 230 ITAG 235 entry recovers from a flush, processor 200resets the recovering pending bit (RP) 238 that corresponds to thatentry.

If an instruction reads data from that particular GPR 230 entry, thecorresponding reset RP 238 bit informs the instruction that theparticular GPR 230 ITAG 235 entry is ready for dispatch. If theparticular GPR 230 entry is still recovering from a flush and is not yetcomplete, the corresponding RP 238 bit is set, or active. In that case,a dependent instruction will wait at decoder/dispatcher 215 until GPR230 dependent entries recover. In other words, the dependent instructionwaits or stalls until processor 200 writes restore data from RBU 240following the flush operation. The dependent instruction sitting at thedispatch stage reads the GPR 230 RP 238 bits by using the instruction RA231 and RB 232 operand address fields. If either the RP 238 bit for RA231 or RB 232 is =1, then the dependent instruction must wait at thedispatch stage until that GPR 230 location is recovers from RBU 240. Ifboth of the RP 238 bits for RA 231 and RB 232 of the instruction =0,then these GPR 230 locations are complete with recovery, or processor200 does not require the recovery operation. When both of the RA 231 andRB 232=0, then the dependent instruction is dispatches. After thedependent instruction dispatches, the next instruction from dispatchwill read GPR 230 for RP 238 data to determine if any RP 238 bit is setfor its RA 231 and RB 232 bits.

FIGS. 6A-6B show a flowchart that describes one example of the discloseddispatch stall penalty reduction method. The stall penalty reductionmethod starts, as per block 605. Processor 200 initiates a flushoperation in response to a branch misprediction fault, exception faultor other cause, as per block 610. In response to the flush operation,processor 200 sends FLUSH ITAG 225 from BEU 222 to GPR 230, as per block615. Processor 200 performs a test to determine if GPR 230 ITAG 235 isequal to or younger than FLUSH ITAG 225, as per block 620. If ITAG 235is equal to or younger than FLUSH ITAG 225, processor 200 sets recoverypending bit RP 238=1 in GPR 230, as per block 625.

If the GPR test per block 620 is not true, or after the RP 238 bit isset =1, processor 200 reads the next RBU 240 RV 247 bit, as per block630. Processor 200 performs a test on the RV 247 bit to determine if RV247=1, as per block 635. If RV 247 is not equal to 1, processor 200reads the next RBU 240 RV 247 bit, as per block 630. However, if thetest of RV 247=1 is true, processor 200 reads RT 214, W 242, and PITAG245 data from RBU 240, as per block 640. Processor 200 writes W 234,ITAG 235, and V 236 data into GPR 230, using RT 214 of RBU 240 as writeaddress, as per block 645.

Processor 200 uses the data of PITAG 245 and V 246 of RBU 240 as inputinto the write of ITAG 235 and V 236 of GPR 230. Processor 230 resets RP247=0 in at corresponding instruction recovery location of GPR 230, asper block 650. Processor 230 resets RV 247=0 at recovery instructionlocation in RBU 240, as per block 655. Processor 200 moves to the nextinstruction of RBU 240 by reading the next RBU 240 RV 247 bit, as perblock 658. Processor 200 performs a test to determine if all RV 247bits=0 for all ITAG instruction entries of RBU 240, as per block 660.

If the recovery operations stops, all RV 247 bits=0, and processor 200performs a dispatch stop, as per block 665. Processor 200 reads RP 238RA 231 and RB 238 bits, as per block 670. In other words, while therecovery process executes, processor 200 checks the RP 238 bits for itsRA 231 and RB 232 data. Processor 200 performs a test to determine ifeither of the RP 238 bits for its RA 231 or RB 232 is =1, as per block675. If RA 231 or RB 232 are =1, then the instruction stalls atdispatch, as per block 680. The instruction stalls at dispatch becausethe RA 231 or RB 232 location is still waiting for recovery. If both ofthe RP 238 bits for the instruction sitting at the dispatch stage are=0, then these GPR 230 locations are not waiting for recovery, and theinstruction is ready for dispatch. The next dispatching instruction willcheck its RP 238 bits for both RA 231 and RB 232 to determine if it isdispatch ready or not. In this manner, the flush operation will allowinstructions to continue dispatching while the recovery process isexecuting. Processor 200 dispatches the next instruction, as per block685. The stall penalty reduction method ends, as per block 690.

The foregoing discloses methodologies wherein a processor may employregister recovery operations after an instruction queue flush that anexception such as a branch misprediction causes. A processor within anIHS may employ ITAG and previous ITAG PITAG registers along withrecovery pending RP, recovery valid RV, and other registers as temporarystorage for instruction flush data. In one embodiment, the processorperforms a recovery buffer unit recovery in response to, or as part of,an issue queue flush. In one embodiment, the recovery buffer unit storespointers to a current instruction entry and previous instructionentries. The processor may quickly analyze these recovery buffer unitpointers to quickly determine an oldest entry of the recovery bufferunit that does not require recovery. The processor may use this oldestentry of the recovery buffer unit to restore the state of a generalpurpose register to a known good state in response to an issue queueflush.

The foregoing also discloses a methodology that may reduce dispatchstall delay after the start of an issue queue flush. A processor withinan IHS may perform general purpose register recovery operations after aninstruction flush operation that an exception, such as an instructionbranch misprediction or other event, causes. The processor recovers thegeneral purpose register to a useable state that existed prior to theflush operation. During the flush operation the decoder/dispatcher maystall the dispatch of an instruction until the flush operationcompletes. In one embodiment, the decoder/dispatcher dispatchesinstructions that a recovery buffer unit indicates as valid instructionsduring the flush operation. The recovery buffer uses recovery pendingbit data at the GPR, recovery valid bits at the recovery buffer, andother register data to determine the validity of dispatchinginstructions. If a dispatching instruction is valid, that instructiondispatches prior to the flush operation completion. In this manner,during the flush operation, instructions continue to dispatch withoutcausing a dispatcher stall event.

Modifications and alternative embodiments of this invention will beapparent to those skilled in the art in view of this description of theinvention. Accordingly, this description teaches those skilled in theart the manner of carrying out the invention and is intended to beconstrued as illustrative only. The forms of the invention shown anddescribed constitute the present embodiments. Persons skilled in the artmay make various changes in the shape, size and arrangement of parts.For example, persons skilled in the art may substitute equivalentelements for the elements illustrated and described here. Moreover,persons skilled in the art after having the benefit of this descriptionof the invention may use certain features of the invention independentlyof the use of other features, without departing from the scope of theinvention.

1. A method of processing instructions, comprising fetching, by a fetchunit, a plurality of instructions from an instruction source; decodingthe plurality of instructions, by a decoder/dispatcher, the plurality ofinstructions including instructions that reference a target register ofa general purpose register for storage of results thereof, theinstructions that reference the target register including a currentinstruction and previous instructions; dispatching, by thedecoder/dispatcher, the plurality of instructions to an issue queue,issuing, by the issue queue, the plurality of instructions thatreference the particular target register out-of-order to executionunits; storing a respective entry in a recovery buffer unit for each ofthe previous instructions and the current instruction, the previousinstructions and the current instruction referencing the same targetregister; speculatively executing, by the execution units, the previousinstructions and the current instruction; determining, by a branchexecution unit, if an exception occurred for one of the previousinstructions that requires an issue queue flush; performing an issuequeue flush if an exception occurred in the determining step;performing, in response to the exception, a recovery buffer unitrecovery to determine an oldest valid entry of the recovery buffer unit;and restoring the general purpose register to a known good state using aresult of a target register corresponding to the oldest valid entry ofthe recovery buffer unit determined in the recovery buffer unitrecovery.
 2. The method of claim 1, wherein the performing of therecovery buffer unit recovery is in response to flushing ofspeculatively executed instructions after a branch misprediction.
 3. Themethod of claim 1, wherein each recovery buffer unit entry includes afirst pointer to a first instruction and a second pointer to a secondinstruction immediately following the first instruction.
 4. The methodof claim 3, wherein the first pointer is a previous instruction and thesecond pointer is a current instruction.
 5. The method of claim 4,wherein the first pointer is a PITAG address and the second pointer isan ITAG address.
 6. The method of claim 4, further comprising storing arespective recovery valid bit per entry in the recovery buffer unit. 7.The method of claim 6, further comprising determining, by the recoverybuffer unit, from the first and second pointers and recovery bits of theentries of the recovery buffer unit an oldest entry of the recoverybuffer unit that does not require recovery in response to the exception.8. The method of claim 7, further comprising writing to the generalpurpose register with information from the oldest entry of the recoverybuffer unit that does not require recovery in response to the exception,thus restoring the general purpose register to a known good state.
 9. Aprocessor, comprising: a fetch unit that fetches a plurality ofinstructions from an instruction source; a decoder/dispatcher thatdecodes the plurality of instructions, the plurality of instructionsincluding instructions that reference a target register of a generalpurpose register for storage of results thereof, the instructions thatreference the target register including a current instruction andprevious instructions, wherein the decoder/dispatcher dispatches theplurality of instructions to an issue queue that is coupled to thedispatcher; a general purpose register, coupled to the dispatcher, thatstores a state of the processor; a plurality of execution units, coupledto the issue queue, wherein the issue queue issues the plurality ofinstructions that reference the particular target register out-of-orderto the execution units; a recovery buffer unit, coupled to the generalpurpose register, that stores a respective entry for each of theprevious instructions and the current instruction, the previousinstructions and the current instruction referencing the same targetregister, wherein the execution units speculatively execute the previousinstructions and the current instruction; a branch execution unit,coupled to the issue queue, that determines if an exception occurred forone of the previous instructions that requires an issue queue flush,wherein the issue queue performs an issue queue flush if the exceptionoccurred; and wherein the recovery buffer unit, in response to the issuequeue flush, performs a recovery buffer unit recovery to determine anoldest valid entry of the recovery buffer unit, the recovery buffer unitrestoring the general purpose register to a known good state using theresult of a target register corresponding to the oldest valid entry ofthe recovery buffer unit.
 10. The processor of claim 9, wherein theprocessor performs a recovery buffer unit recovery is in response to theflushing of speculatively executed instructions after a branchmisprediction.
 11. The processor of claim 9, wherein each recoverybuffer unit entry includes a first pointer to a first instruction and asecond pointer to a second instruction immediately following the firstinstruction.
 12. The processor of claim 11, wherein the first pointer isa previous instruction and the second pointer is a current instruction.13. The processor of claim 12, wherein the first pointer is a PITAGaddress and the second pointer is an ITAG address.
 14. The processor ofclaim 12, wherein the processor stores a respective recovery valid bitper entry in the recovery buffer unit.
 15. The processor of claim 14,further comprising determining, by the recovery buffer unit, from thefirst and second pointers and recovery bits of the entries of therecovery buffer unit an oldest entry of the recovery buffer unit thatdoes not require recovery in response to the exception.
 16. Theprocessor of claim 15, wherein the recovery buffer unit writes into thegeneral purpose register information from the oldest entry of therecovery buffer unit that does not require recovery in response to theexception, thus restoring the general purpose register to a known goodstate.
 17. An information handling system (IHS), comprising: a memory; aprocessor, coupled to the memory, the processor including: a fetch unitthat fetches a plurality of instructions from an instruction source; adecoder/dispatcher that decodes the plurality of instructions, theplurality of instructions including instructions that reference a targetregister of a general purpose register for storage of results thereof,the instructions that reference the target register including a currentinstruction and previous instructions, wherein the decoder/dispatcherdispatches the plurality of instructions to an issue queue that iscoupled to the dispatcher; a general purpose register, coupled to thedispatcher, that stores a state of the processor; a plurality ofexecution units, coupled to the issue queue, wherein the issue queueissues the plurality of instructions that reference the particulartarget register out-of-order to the execution units; a recovery bufferunit, coupled to the general purpose register, that stores a respectiveentry for each of the previous instructions and the current instruction,the previous instructions and the current instruction referencing thesame target register, wherein the execution units speculatively executethe previous instructions and the current instruction; a branchexecution unit, coupled to the issue queue, that determines if anexception occurred for one of the previous instructions that requires anissue queue flush, wherein the issue queue performs an issue queue flushif the exception occurred; and wherein the recovery buffer unit, inresponse to the issue queue flush, performs a recovery buffer unitrecovery to determine an oldest valid entry of the recovery buffer unit,the recovery buffer unit restoring the general purpose register to aknown good state using the result of a target register corresponding tothe oldest valid entry of the recovery buffer unit.
 18. The IHS of claim17, wherein the IHS performs a recovery buffer unit recovery is inresponse to the flushing of speculatively executed instructions after abranch misprediction.
 19. The IHS of claim 17, wherein each recoverybuffer unit entry includes a first pointer to a first instruction and asecond pointer to a second instruction immediately following the firstinstruction.
 20. The IHS of claim 19, wherein the first pointer is aprevious instruction and the second pointer is a current instruction.21. The IHS of claim 20, wherein the first pointer is a PITAG addressand the second pointer is an ITAG address.
 22. The IHS of claim 20,wherein the processor stores a respective recovery valid bit per entryin the recovery buffer unit.
 23. The IHS of claim 22, further comprisingdetermining, by the recovery buffer unit, from the first and secondpointers and recovery bits of the entries of the recovery buffer unit anoldest entry of the recovery buffer unit that does not require recoveryin response to the exception.
 24. The IHS of claim 23, wherein therecovery buffer unit writes into the general purpose registerinformation from the oldest entry of the recovery buffer unit that doesnot require recovery in response to the exception, thus restoring thegeneral purpose register to a known good state.